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ASYMPTOTIC INFERENCE IN SOME HETEROSCEDASTIC 
REGRESSION MODELS WITH LONG MEMORY DESIGN 

AND ERRORS 

By Hongwen Guo and Hira L. Koul 

Michigan State University 

This paper discusses asymptotic distributions of various estima- 
tors of the underlying parameters in some regression models with 
long memory (LM) Gaussian design and nonparametric heteroscedas- 
tic LM moving average errors. In the simple linear regression model, 
the first-order asymptotic distribution of the least square estimator 
of the slope parameter is observed to be degenerate. However, in 
the second order, this estimator is n 1//2 -consistent and asymptoti- 
cally normal for h + H < 3/2; nonnormal otherwise, where h and H 
are LM parameters of design and error processes, respectively. The 
finite-dimensional asymptotic distributions of a class of kernel type 
estimators of the conditional variance function o~ 2 (x) in a more gen- 
eral heteroscedastic regression model are found to be normal when- 
ever H < (1 + h)/2, and non-normal otherwise. In addition, in this 
general model, log(n)-consistency of the local Whittle estimator of H 
based on pseudo residuals and consistency of a cross validation type 
estimator of o~ 2 {x) are established. All of these findings are then used 
to propose a lack-of-fit test of a parametric regression model, with an 
application to some currency exchange rate data which exhibit LM. 

1. Introduction. This paper discusses asymptotic distributions of some 
estimators of the underlying parameters in some heteroscedastic regression 
models with LM in design and errors. This is of interest partly for the 
purpose of regression model diagnostics and partly for the sake of some 
large sample inference in these models. Regression models with LM in both 
design and error variables are useful when LM in the given response variable 
is not fully explained by LM in the given design variable; see [26]. 

For the sake of clarity of exposition, we first focus on a simple linear 
regression model where one observes a strictly stationary bivariate process 
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(X t ,Y t ), t G Z := {0, ±1, . . .}, both having finite and positive variances and 
obeying the model 

(1.1) Y t = 0o + (3iX t + a(X t )u t , for some (J3q, 0i) G M 2 , 



(1.2) 



3=0 



as j — > oo, for some i < i? < 1 



Here, et are standardized i.i.d. r.v.'s, independent of the A^-process and the 
constant C is such that YlfLo &| = 1- Under this set up, a 2 (x) = Var(Yo| Ao = 
x), i£K, and Ea 2 (X ) < oo. 

For a stationary second-order process £t, teZ, let (7^) denote its spec- 
tral density (auto-covariance function). We also assume that {X t } is a Gaus- 
sian process with mean //, variance 7 2 := 7x(0), and 

(1.3) 7x(&) ~ ^2(1-/1) as — > 00, for some 1/2 < ft, < 1, 

where 0(/i) := 2r(2 — 2/i) cos(-7r(l — h)) and Gx > is a constant. The se- 
quence bj is also assumed to satisfy bj — > 0, as j — > 00 and for some a < 00, 
frj+l < &j(l +j~ ft), for all sufficiently large j. This condition, for example, is 
satisfied by FARIMA(0, H - 1/2, 0) model where = T(j + H - 1 /2) /r(j + 
l)r(if — 1/2). As pointed out in [24], page 1632, under this condition, 

f u (X) ~ GuX 1 ' 211 , A^0+; 

(1.4) 

7u(fc) ~ G u 9(H)k~ 2 ^~ H \ k^oo, 

where G u is a positive constant. 

Several authors have discussed regression models with LM errors when 
a{x) = c, a constant. The asymptotic distributions of the least squares esti- 
mator (LSE) and M- and R-estimators in nonrandom design linear regres- 
sion models with LM errors are established in [12, 17, 18, 33, 34] and for 
nonlinear regression models in [19]. The asymptotic distribution of the gen- 
eralized LSE (GLSE) in certain polynomial regression models is discussed 
in [6] and [16]. The errors in [6] are assumed to be LM Gaussian, while in 
[16] a function of a long memory moving average (LMMA) process. 

In the context of homoscedastic multiple linear regression models with LM 
in both covariates and errors and when the error process has a known para- 
metric spectral density, the GLSE of the slope parameter vector is known 
to be n 1 / 2 -consistent and asymptotically normal with the Gauss-Markov 
variance; see [26]. This result is adapted in [15] to the case where the error 
spectral density is semi-parametric as in (1.4). A crucial result needed here 
is the availability of a preliminary n 1 / 2 -consistent estimator of the slope pa- 
rameter vector. In [26] it was also noted that the LSE is ra 1//2 -consistent for 
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certain values of H, h. In a simulation study reported in [15] it was found 
that the adaptive estimator where LSE was used as a preliminary estimator 
had smaller MSE compared to the one where GLSE was used. This partly 
motivates the need to understand asymptotic behavior of the LSE in the 
current set up, for were one to carry out an analogous adaptation program 
here, even to get started one would need a preliminary ra 1//2 -consistent esti- 
mator of [3\ in order to estimate cr(x) and / U (A) nonparametrically. And at 
least for those values of H and h for which the LSE is n 1 / 2 -consistent, its use 
in constructing such adaptive estimators would be justified. Other reasons 
are to be able to estimate G u and H and to understand asymptotic behavior 
of some lack-of-fit tests for fitting a heteroscedastic linear regression model. 
Currently there is a void on this topic in the literature which this paper is 
attempting to fill. Because of its simplicity, it is desirable to use the LSE for 
these purposes. 

Section 2 discusses asymptotic distribution of the LSE (3 := ($o,$i)' of 
P ■= (/3 , fa)' in the models (1.1) and (1.2). The weak limit of n l ~ H {(3 - p) 
is shown to be a bivariate normal distribution, for all 1/2 < H, h < 1. But if 
H = EX = and cr(x) is an even function, then this asymptotic distribution 
of /3i is degenerate. In this case we further obtain that if H + h < 3/2, 
n l l 2 (f3\ — Pi) converges weakly to a normal r.v. On the other hand, in the 
case H Ah:= mm(H, h) > 3/4, and even when both m and Xt are Gaussian, 
Pi has a nonnormal weak limit with the normalization n 2 ~ H ~ h . 

To implement the proposed lack-of-fit test for fitting a regression model 
or to carry out some inference about P and cr 2 (x), one needs consistent 
estimators of cr(x), G u and a ln(n)-consistent estimator of H. Section 3 
derives asymptotic distributions of a class of kernel type estimators cr 2 (x) 
of cr 2 (x) in the regression model 

(1.5) Y t = p , r(X t ) + a(X t )u t , PeR q , 

where r(x) is a vector of some known q functions and the rest of the 
entities are as in (1.2) and (1.3). It is proved that when H < (1 + h)/2, 
the finite-dimensional distributions of n 1 ~ h (a 2 — a 2 ) converge weakly to k- 
variate normal distributions, while for H > (1 + /i)/2, the weak limit of 
n 2 ~ 2H (a 2 (x) — a 2 (x)) is nonnormal. 

Using the approach in [25], the local Whittle estimator of H based on 
the pseudo residuals Y t — P'r(X t ) in the model (1.5) is shown to be log(n)- 
consistent, where P is the LSE. This is unlike the case of nonparametric 
heteroscedastic regression model with X% = t/n, 1 < t < n, and LMMA er- 
rors, where it is necessary to base estimators of H on the standardized 
residuals; see [11]. 

An important inference problem is to assess the accuracy of an assumed 
regression model. Let (X, Y) denote a copy of (Xq,Yq) and n(x) := E(Y\X = 
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x). Consider the problem of testing Hq : fi(x) = (3'r{x), for some (3 £ R 9 and 
for all x 6 R, against the alternative that Hq is not true. In the 1990's, several 
authors found that tests of Hq based on the marked empirical process 

n 

Vn{x) = ]T (Y t - P'r(X t ))I(X t <x), x€R:= [-00,00], 
t=i 

have desirable level and power properties against a broad class of alterna- 
tives; see, for example, [[1], [21], [29], [30], [31]], among others. See [1], pages 
132-134 of [13], and [29] for more motivation about using this process for 
lack-of-fit testing. In the presence of long memory in design and/or errors 
and when cr(x) = c, some tests based on this process have been studied in 
[20]. 

Under the current set up, Theorem 5.1 below proves that, under Hq, 
n~~ H V n (x) converges weakly to J a {x)ipiZ , in T>(M.) and uniform metric, 
where Z is a N(0,1) r.v., ^ := G U 6(H)/H(2H - 1), J a (x) := E[a{X) - 
E(a(X)r(X))'A- 1 r(X)] x I(X < x), and A" 1 is assumed to exist. Thus, to 
use V n for testing Hq, we need a uniformly consistent estimator of J a (x) 
and a consistent estimator of A uniformly consistent estimator of J a , 
under Hq, based on the leave-one-observation-out estimator of cr(x), is given 
in Section 5. The regular kernel type estimator is not useful here because of 
the unstable behavior of a 2 (Xt). The estimators of G u and H constructed 
in Section 4 are used to provide a consistent estimator of ipi under Hq. 

Section 6 includes a finite sample simulation and an application to some 
monthly currency exchange rate data that exhibits long memory. The last 
section is the Appendix consisting of some proofs and necessary lemmas. 

In this paper all limits are taken as n — > 00, unless specified otherwise. 
For any two sequences of real numbers, a n ~ b n means that a n /b n — ► 1, — >d 
stands for the convergence in distribution of a sequence of r.v.'s, while 
denotes the weak convergence of a sequence of stochastic processes, and u p (l) 
denotes a sequence of stochastic processes that tends to zero uniformly over 
its time domain, in probability. Henceforth, the independence of X% and ut 
processes is used without mention. 

2. Asymptotics of the LSE. This section discusses asymptotic distribu- 
tion of the LSE in the model (1.1)— (1.3). For this purpose, we need the 
following result. Let v be a real valued function on M with Eu 2 (X) < 00. 
Set vq := Ev(X). By (A. 20) below, there is a C < 00 such that \E(u(X ) - 
vo)uo(u(Xt) — vq)im\ < Ct~ A+2H+2h , for all sufficiently large and positive t. 
Hence, Vl/2 < h,H < 1, 

n n n 

n~ H ^v(Xi)ui = VQn~ H ^Ui + n~ H ^{v(Xi) - u )ui 
i=i i=i i=i 
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= u n H ^Ui + o p (l). 
i=l 

Next, let D{a) := 8(a)/a(2a — 1), 1/2 < a < 1, and Z\,Z 2 be two indepen- 
dent r.v.'s, 2,- having 7V(0,^j) distribution, j = 1,2, where = G U D(H), 
i)\ = G x D{h). Let a := £cxpf), J := £X(t(X), 7^ := (J - fia ) = 
Cov(X,a(X)) and To := oo — [(^ — ^ a o)^h 2 ]- From [8], we obtain 

n n 

(2.2) Z nl :=rr H 5>i4z!, ^^(1, -/i)/ 7 4z 2 . 

i=l i=l 

Now, let <T t := a(X t ), e t := a t u t , nX :=_J2t=i x t, nu := J2t=i u t, n ^ '■= 
J2t=i Y u ne := £?=i e t , ns 2 := T,t=i( X t ~ x ) 2 - Recall that the LSE satisfies 

(2.3) h-Px = ^{n- 1 Y^X t e t -X^ fa - fa := e - X0i - Pi). 

From (2.1) and (2.2) applied to u(x) = a(x) we obtain that e = O p (n^~ 1 ), 
X-n = O p {n h ~ l ). These facts and (2.3) yield that n 1 ' 11 (Po~Po) = F Z nl + 
o p (l), n l ~ H ((3i — Pi) = T\Z n i + o p (l) and hence, the following: 

Lemma 2.1. Under (1.1)-(1.3), n 1 ~ H o - (3 Ji - Pi) (r ,ri)Zi. 

But Li = if either cr(x) = c, a constant or /i = and cr(x) is an even 
function of x. Since in these cases the weak limit of n l ~ H (Pi — P\) is de- 
generate at zero, it is pertinent to investigate higher-order approximation to 
the distribution of P\. The former case has been discussed in [20]. We shall 
next discuss the second-order result under the following: 

Assumption 1. a(x) is an even function of x £ E and [i := E(X) = 0. 

Let a{z) := /* u^ 2 (l - u) 1 ' 2 * du, 1/2 < z < 1, C : = (^gfey) 1/2 and 

(2.4) Z 2 := C j £ [(s - Xl )-^- 2H ^ 2 (s - x 2 )-^~ 2h y 2 } 

x I{x\ < s,X2 < s) ds dBi(x\) dB 2 (x 2 ), 

where B\ and B 2 are the two Wiener random measures; see [32]. We also 
need to define Z n2 := n 1 """* £ t re =1 X t u t / 7 , Z n2 := n~ h £™ = i *i/7> and a := 
E(X 2 a(X)). We are now ready to state and prove the following: 
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Lemma 2.2. Suppose (1.1)— (1.3) and Assumption 1 hold. Then, 

n 

(2.5) n x - u ~ h E Xtatut = lc x Z n2 + o p (l), 

t=i 

(2.6) n 2 - H - h Xe = ja Z nl Z n2 + o p {l) Vl/2 < h,H < 1. 
Moreover, for H + h> 3/2, Correl(i? n 2, Z n \Z n -i) converges to 



v/2(2if + 2h - 3){2H + 2h-2) / Hh 



(2H + 2h-l) y (2#- l)(2/i- 1)' 

Proof. Let -f/j denote the jth Hermite polynomial, j > 1; see, for ex- 
ample, [32]. The Hermite expansion of the function xa{x) is J2j^=o~/\Hj(x), 
where Cj := E(Xa(X)Hj(X)),j > 0. Since under Assumption 1, Co = 0, 
c\ = EX 2 o~{X) ^ 0, the Hermite rank min{j > l;cj ^ 0} of xa(x) is 1. 
Hence, in L 2 , 

n n n oo 

(2.8) n" 1 ]T X t a t u t = - E*t«t + n^ 1 E«t E ^ H j( X t) =■ S n + T n . 

t=l 71 t=l t=l j=2 

Using the independence of X^s and tt,'s, we obtain V&r S n = 0(n~ A+2H+2h ). 
Because of the orthogonality of the Hermite polynomials, 

n n ( oo oo ^ 

Var(T n ) = n~ 2 EE Eu s u t E E^^E JjH k (X t ) 

s=lt=l lj=2 k=2 ) 

n n 

\2H-2\ n .\4h-A 



=li=l 



< Cn- 6+2H+4h Inn = o(Var S n ). 

This fact and (2.8) readily yield (2.5). The claim (2.6) is proved similarly, 
using the fact that under Assumption 1, the Hermite rank of o~(x) is 2. 
To prove (2.7), let m = G x G u 9{H)6(h). By (2.1) and (2.8), 

, \ , / " , \ 2c 2 K X n 2H + 2h -^ 

Var > Xtet/n ~ Ci Var > Xtu+ n ~ — -, 

1 j 1 ' j (2H + 2h-3)(2H + 2h-2)' 

n 2 2H+2h-4 

Next, by the Hermite expansion of xa{x) and o~(x), we obtain 



n n n 



E [ n ~ 1 E X ^ X ^ ~ cidon" 3 E E E E{X t X s )E{u t u k ) 

\ t=l ) t=\s=\k=\ 
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-3+2#+2/i-4 



n n n 

EEE 

f=l 8 =1 k = l 

/!„„_. „2ff+2/i-4 

(2H - l)(2h - 1)(2H + 2/i-l) 



f 


s 


2ff-2 


/ 




2h 


n 


/; 
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This proves (2.7). □ 



The following theorem gives a nonstandard second-order limiting distri- 
bution of /3i when H A h > 3/4, where i% is as in (2.4) with B\ and £>2 
independent. 

Theorem 2.1. Suppose (1.1)-(1.3) and Assumption 1 hold, with ej's 
being N(0, 1) r.v.'s. Then, forhAH> 3/4, 

(2.9) n 2 - H - h {fi x - /3i) -+ d 7 _1 [ci^2 - <r ZiZ 2 ]. 
Proof. Lemma 2.2 combined with (2.3) yields that 

n 2-H-h0 i _ = l[ ClZn2 _ ao Z nl Z n2 ] + 0p (l) Vl/2 <h,H<l. 

7 

Using the derivations similar to those in the proof of Theorems 6.1 and 6.2 
in [10], one verifies that (Z n 2,Z n \,Z n 2) — >d (Z2, Z\, Z2). Upon identifying 
D\,D2 there with 2(1 — /i),2(l — H), respectively, one sees the condition 
< D\,D2 < 1/2 is equivalent to h A H > 3/4. These facts together with 
(2.3) complete the proof of (2.9). □ 

Consistent estimates of c\,ao and 7 are Ya=i X\ fVi(Xi)/n, J2?=iVi{Xi)/n 
and s, respectively, where V^'s are defined in (5.2) below. However, the dis- 
tribution of the limiting r.v. in (2.9) is not easy to determine and, hence, 
any decent inference about f3\ based on (3\ appears to be infeasible in this 
case. 

We shall next discuss asymptotic distribution of (3\ when ttj's form the 
moving average (1.2) and H + h < 3/2. This in turn is facilitated by the 
following lemma where U n := n~ l l 2 Ylt=i u {Xt)ut and Ju(k) = Euqu^. 

Lemma 2.3. Suppose (1.1)-(1.3) holds. In addition, suppose v is a mea- 
surable junction such that Ev(X) = 0, Ev 2 (X) < 00, the Hermite rank of 
v{X) is 1, and 

(2.10) max |^(x)|/?i 1/2_r? -> 0, for some < rj < 1/2. 

{0<x<lnn} 

Then, for H + h < 3/2, U n ^ d N(0,k 2 ), where k| = lim n ^ 00 £'L^ = 
7^(0)7u(0) + 21im 

n— >oo 
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Proof. The proof uses the truncation method similar to the one used 

in [26]. The main idea here is to approximate U n by a weighted partial sum 

of the i.i.d. r.v.'s {£«}. Fix H, h such that H + h < 3/2. Let v% := v(Xt) and 
M = M n > n (2/^i)/(2-2H) 5 and define 

n n 

U n ,M ■■=rT 1/2 Y^v t ^2 b t-j e j- 
t=l j=-M 

Because the Hermite rank of vt is 1, by (A. 20) below, 

-Af-i / n \ 2 

E{U n -U nM ?=rr l Y. E [J2 u tbt- j ) 

j=-oo \t=l J 

(2.11) 

Hence, it suffices to show that (a) EU 2 M — ► k 2 and (b) U n ,M — >dN(0, K 2 ). 

Let J 7 := cj-{X t ,t > 1}. The claim (a) is implied by E(JJ^ M \T) — > v k 2 . 

Let d nJ := -±^J2t=i "th-j ■ Then, U n>M = T,j=-M d n,j £ j and e ( u I,m\?) = 

E"=-M<r Recall 7«(fc) =^«o«fc = E^o&A-+fc- Rewrite £(C^ M |^) = 
A + 2B, where 

n n n n— 1 n 

A = n _1 J2 u t b t-j, B = n~ 1 E E Wsh-jbs-j. 

j=-Mt=l j=-M s=lt=s+l 

But A = A\ + A 2 , where 

n n 

M = n~ l J2 E(^->(0))^, 
j=-Mt=l 

n / oo oo \ 

= n- i ^(^- >( o))K:^- e «po, 

t=l \fe=0 k=t+M+l ) 



A 2 := ll/ (0)n- 1 E 6 ? 



n n 

2 

i 

j=-Mt=l 



n / oo 



i=l \fc=0 k=t+M+l / 

because Er=Af - 0, E?=i(^ - 7 „(0))/n - and E?=i K 2 - 7,(0)|/n - 
C < oo, a.s., by the Ergodic Theorem. Also, sup n i^|Ai| < 0^(0) < oo. 
Hence, E^] -> and J3|>1 - 7„(0)7 tt (0)| -> 0. 

Next, let := n _1 ElLi u t u t+j- Then, one can rewrite B = B\ — B 2 , 
where J3i :=Y2=i9klu{k) and £ 2 := E£=r 9k Y,?=k + M+l b k+j b y B y ( 2 - n )' 
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EB\ — ► 0. For the term B\, we have 

n— 1 n— 1 

5 i = X! ~ Ti/WlTwW + XI lv{k)"f u {k) =: B u + £ 12 say. 
fc=i fe=i 

By applying Theorem 6 of [3] and the fact that the Hermite rank of the 
bivariate function VtVt+k — 'Yv{k) is 2, we obtain sup fc E\gk — r ) v {k)\ < Cn 2h ~ 2 , 
for 1/2 < h < 1, and hence, for H + h <3/2, 

n-1 

E\Bn\ < Cn 2h ~ 2 ]T | 7u (fc)| = 0(n 2H + 2 ^ 3 ) -> 0. 
k=i 

Also, note that lim n i?i2 exists for H + /i < 3/2. These facts and the fact 
that k 2 = 7l /(0) 7u (0) + 21im n i?i2 complete the proof of (a). 

The claim (b) is proved by showing that the conditional distribution of 
U n ,Mi given T, converges weakly to iV(0, Kz). In view of the fact (a), by the 
Lindeberg-Feller theorem, this is equivalent to showing 

(2.12) Pi max \d ni \>8)^0 for all 5 > 0. 

To prove this, recall from [5] that maxi<^< n \Xf\ — Op (In re). By the Gauchy— 
Schwarz (C-S) inequality, we obtain that, for any integer I > 0, 

n 

max \d n j\= max nT 1 ! 2 v t bt-j(I{\t — j\ > I) + I(\t — j\ < I)) 

— M<j<n ' —M<j<n t—\ 



n 



1/2 / „ \ 1/2 



<»- 1/2 (X>?J j^Di-W-M 

n 

+ n~ 1/2 max \u t \ max ^ \h-j\I(\t - j\ < I) 

Kt<n —M<i<n — * 
- J - t=l 

= ( J ( r l+H + n ~l/2f max lv ( x) X H -l/2 
V \{0<a;<lnn} / 

In view of (2.10), this upper bound is o p (l), for any I = 0(n 2 ' ? /( 2 ^ _1 )). Hence, 
(2.12) follows, thereby completing the proof of the lemma. □ 

Now, take v{x) = xa{x) in the above lemma. Because a is an even func- 
tion, the Hermite rank of v(x) is 1. Also, the fact maxi<j< n \X t \ = O p (lnn) 
and (2.6) imply Xe = o p {n~ 1 / 2 ) for H + h< 3/2. Hence, we readily obtain 
the following: 

Theorem 2.2. Suppose (1.1) - (1.3), and Assumption 1 hold. In addi- 
tion, suppose EX 2 o~ 2 {X) < oo and (2.10) holds with v(x) =xa(x). Then, for 
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H + h < 3/2, n x l 2 {fi x - ft) -+ d AT(0,/c 2 /7), where now K 2 = 
lim„ Efc=o £{A a(A ) AV(A fc )}£(n n fc ) • 

An estimate of k 2 is obtained as follows. Because, under Assumption 1, 
\Cov{X a(X )u ,X k a(X k )u k )\ <Ck~ 4+2H+2h , for all sufficiently large k, 
the process Xtcr{Xt)ut is weakly dependent when H + h < 3/2. Thus, one 
may use the block bootstrap method to estimate k 2 here using X t Vt(X t )(Yt — 
$0 — PiXt), 1 < t < n, see [22], where Vt is as in (5.2) below. Although we 
do not prove it here, such an estimator should be consistent for k 2 . 

3. Asymptotic distribution of & 2 {x). In this section we shall investi- 
gate asymptotic distribution of the kernel type estimator a 2 (x) of c 2 (x) in 
regression model (1.5) under the following: 

Assumption 2. £||r(X)|| 2 < oo and A := Er(X)r(X)' is nonsingular. 

An example of r := (ri, . .. ,r q ) satisfying this condition is rj(x) = x J , j = 
l,...,q. 

To define a 2 (x), let K be a density function on [—1,1], b = b n be se- 
quence of positive numbers, <fr denote the density of the iV(0, 1) r.v., (p(x) := 
7 -1 <^((a; - n)/i), and <p n (x) := s"V((^ - A)/s). Let K b (x) = K(x/b)/b, 
Kbt(x) := Kb(x — Xt), and define the kernel type estimator of a 2 (x) to be 

1 n 

a 2 (x) := — Y,K bt (x)el e t := Y t - f3'r(X t ). 

Now fix an x £ M. and consider the following additional assumptions. 

Assumption 3. The density K is symmetric around zero. 

Assumption 4. The function a 2 is twice continuously differentiable in 
a neighborhood of x. 

Assumption 5. The bandwidth b satisfies 6^0, ra 2/l ~ 1 (lnn)~ 1 6 — > oo 
for 1/2 < h < 3/4, n 2 - 2h b -> oo for 3/4 < h< 1. 

To describe our results, we need to introduce Z* 2 := n l ~ 2H Ylt=i ( u t ~ 
1), n ra := Er(X)o~(X). The proof of the following theorem is given in the 
Appendix. In it, Z| is the Z 2 of (2.4) with Bi = B 2 and V := (^i + V|) 1/2 - 

Theorem 3.1. Suppose (1.2), (1.3), (1.5), Assumptions 2, 3 and 5 /io/d 
and Ee 4 < oo. 
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(a) In addition, suppose x±,...,Xk are k>l points at which Assumption 
4 holds and r is continuous, H < (1 + h)/2, and 

(3.1) n 1_/l 6 2 ^0. 



Then, {n 1 h (a 2 (xj) — o~ 2 (xj)),j = l,...,k} converges in distribution to 

(b) In addition, suppose x is a point at which r is continuous and As- 
sumption 4 holds, H > (l + h)/2 and 

(3.2) ra 1_H 6->0. 
Then, 

n 2 - 2H {a 2 {x)-a 2 {x)) 

(3.3) = a 2 (x)Z* 2 + \p' ra A~ 1 r(x)r(x)'A~ fj, ra 

+ y! ra A- 1 r{x)a{x)]Zl 1 + o p (l). 

Moreover, (Z* 2 ,Z nl )^ d (ZlZ) and CoTvel(Z* 2 ,Z 2 ^ - 2H 



ilJ ~ ' lff-1 V 2H-1 

Consistent estimators of ip\ , ip2 are obtained by plugging in the estimators 
of H,G u ,Gx and h in there while that of \i ra is n -1 J2?=i r (Xi)Vi(Xi) . 

Remark 3.1. Suppose we choose b = 0(n~ s ). Then Assumption 5 and 
(3.1) hold, for all 6 in the range (1 — h)/2 < 6 < 2(1 — h) whenever h > 3/4; 
and for all 5 in the range (1 — h)/2 < 6 < 2h — 1, whenever h < 3/4 in 
the case (a). Similarly, in the case (b), Assumption 5 and (3.2) hold for 
1 — H < 5 <2h — 1 whenever h < 3/4; and for 1 — H < 5 <2 — 2h whenever 
h > 3/4. 

We also note here that by using the truncation method as in [2] , the above 
Theorem 3.1 will continue to hold for a symmetric density kernel function K 
with noncompact support and finite variance, for example, normal density. 



4. Estimation of H. In this section we consider the problem of estimat- 
ing G U ,H in the model (1.5) based on i t ■= Y t — (3'r(X t ), 1 < t < n. 

For a process f t ,l <t < n, let w^X) := (2vrn)^ 1 / 2 J2t=i &e itA , I € (A) := 
|u;^(A)| 2 , A G [— 7r, 7r], denote its discrete Fourier transform and periodogram, 
respectively, where i := (— l) 1 / 2 . Fix 1/2 < a\ < a-i < 1. With Aj := 2irj/n 
and an integer m £ [l,n/2), for a\ < tp < a 2 , let 

-, m m 

Q(V0 := -E A f W) =log W) - (2^-l)£logA i . 

771 — * J — ' 

3=1 3=1 
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Then the local Whittle estimators of G u and H in the model (1.1) based on 
{e~t} are defined to be G u = Q(H), H = arg miring r ai a2 i R(tp), respectively. 

The log(n) consistency of an analog of H and consistency of an analog 
of G u in nonparametric homoscedastic regression models with X% = t/n, 
t = 1, . . . , n is proved in [25] . The following theorem shows that these results 
continue to hold in the regression model (1.5) under much simpler restric- 
tions on m than those required in [25], partly due to the parametric nature 
of the model and partly due to random design. 

Theorem 4.1. Suppose, in addition to (1.2), (1.3), (1.5), Assumptions 2 
and 5, the following holds: 

Then, ln(n)(H — H) —> p 0, G — G u -+ p 0. 

The proof of this theorem is sketched in the Appendix. We note here that 
if m = Cn a for an < a < 1, then (4.1) holds for H >h. In the case H < h, 
it holds for any a > (2h — H — l)/(2h — 2H). In particular, in the case of 
Gaussian {ut}'s, [14] shows that the optimal bandwidth m equals Cn 4 / 5 , 
which always satisfies (4.1). 

5. Regression model diagnostics. In this section we investigate the weak 
convergence of V n under Hq and the assumptions (1.2)-(1.3). The following 
Glivenko-Cantelli type result is used repeatedly in this connection: for a 
measurable real valued function g, with E\g(X)\ < oo, 



(5.1) sup 



n 

t=\ 



1 ]T g(X t )I(X t <x)- Eg(X)I(X < x) ^ 0. 
t=\ 

We are now ready to state and prove the following: 

Theorem 5.1. Under (1.2), (1.3), (1.5) and Assumption 2, 

SUp\n~ H V n (x) - J a (x)Z n i \ = O p (l). 

Hence, under Hq, n~ H V n {x) => J a (x) , i^iZ, in T>(M), and uniform metric. 

Proof. Let Z n = Ylt=i r (Xt)atUt, nA n := A n , and for an ie| let 
a(x) := Er(X)I(X < x), L(x) := Ea 2 (X)I(X < x), 



a n {x) :=n- 1 ^r(X t )/(X t <x), F a {x) := Ea{X)I{X < x), 
t=i 

n n 

V„(z) := Y,o t utHX t < x), U n {x) :=J2utWtI(X t <x)- F a (x)}. 



t=i t=i 
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Now, assume Hq holds. Using the Hermite expansion argument, we have 
E(n~ H [U n (x) -U n (y)}) 2 < Cn^-^L^y) - L(x)\, for all x,y £ M. Then 
the chaining argument of [9] yields that sup^gjj \U n {x)\ = o„(l), and hence, 
n~ H V n (x) = F a (x)Z n i + u p {l). By (2.1), we also have n~ Z n = [i T(J Z n \ + 
o p (l), A n = A + o p (l), and by (5.1), sup^ ||a n (a;) - a(x)\\ = o p (l). Note 
also that J a (x) = F a (x) — fjf ra A~ a(x). From these facts we readily obtain 

n 

n- H V n (x) := n- H Y^[Yt ~ P'r(X t )]I(X t < x) 
t=i 

= n~ H V n (x) - n~ H Z' n A~ 1 a n (x) = J a (x)Z nl + u p (l). 
This, uniform continuity of J a and (2.2) complete the proof. □ 

In order to implement the above result, we need a uniformly consistent 
estimator of J a (x). One of the unknown entities in J a is &(X). Because of 
unstable behavior of a(Xt), we shall use an alternate estimator of <r(x) based 
on the ideas of cross validation method that leave one observation out each 
time. For this purpose, we assume the design density is known, that is, ^,7 
are known and take them to be 0, 1 without the loss of generality. Let 

/ 1 n \l/2 

(5.2) A^( x ):=^-—Y, K bt(x)e 2 t j , Vi(x) := A^ i (x)(p~ 1 ^ 2 (x), 

for i = 1, . .. ,n. Note that Vi(x) is an estimator of <j(x) and that of J a {x) is 

n n 

J n {x) = n- 1 ]T Vt(X t )I(X t <x)- n~ x ]T r(X t )Vt(X t )A; 1 a n (x). 
t=i t=i 
To prove its uniform consistency, we need the following: 

Assumption 6. The function a has continuous first derivative. 

Theorem 5.2. Suppose (1.2), (1.3), (1.5), and Assumptions 2 and 6 
hold and that fi = 0,7 = 1. In addition, suppose 6^0, b~ l n 2h ~ 2 = 0(1), 
£||r(2X)|| 2 < 00, Ea 2k (X)rj(X) < 00, for j = l,...,q, k = 0,1, and 

Ea 2 (X)(f) l l 2 (X) < 00. Then, under Hq, sup^gg \J n (x) — J a {x)\ = o p (l). 
The proof of this theorem follows from the following lemma. Let A 2 t (x) = 

Lemma 5.1. Under the conditions of Theorem 5.2, 

(5.3) max E{A 2 _ t (X t ) - a 2 <P(X t )} 2 0, 

l<t<n 

n 

(5.4) n" 1 J2 |A_ t (X t ) - A_t(X t )| V 1/2 (*t) A 0. 

f=i 
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The proof of this lemma appears in the Appendix. We have the following: 



Corollary 5.1. Under the conditions of Theorem 5.2, 
(5.5) max E\A_ t (X t ) - a t (f>^ 2 (X t )\ 4 0, 



KKn 



(5.6) n ^Y,\[V t (X t )-a t ]r,(X t )\^0, j = l,-,q. 

t=i 

Proof. The claim (5.5) follows from (5.3) and the inequality {a 1 / 2 — 
&i/2|2 < | o -6|,oA6>0. We shall prove (5.6) for j = 1 only, it being similar 
for j = 2, . . . , q. It suffices to show that 

n 

(5.7) n- 1 £ \A_ t (X t ) - A_ t (X t )\\ ri (X t )\r 1/2 (X t ) = o p (l), 

t=i 

n 

(5.8) n- 1 J2fi-t(Xt)r 1/2 (Xt)-o-(X t )\\r 1 (X t )\=o p (l). 

t=i 

By the Holder inequality, the expectation of the l.h.s. of (5.8) is bounded 
above by £?=i E^{k^{X t )-a{X t )<t ) 1 l 2 {X t )YE^{rl l \x t ) ( t>^l\X t )} /n. 
Since E\ ri (X)\ 3 / 2 / 4> 3 ^{X) = E\ ri (2X)\ 3 / 2 < oo, (5.8) follows from (5.5). 

Next, by the C-S inequality, the l.h.s. of (5.7) is bounded above by 

( n n 

1 £ |A_t(X t ) - A. t (X t )\ 2 r 1/2 (X t ) ■ Y: r 2 (X t )<p- l/2 (X t ) . 



n 

U=i t=i ) 

But because n" 1 4== ->a . #4P= < CEr\(2X) < oo, (5.7) follows 
from this bound and (5.4). □ 

Proof of Theorem 5.2. The proof follows from (5.6), the triangle 
inequality, the facts that H-A^ 1 — A~ 1 \\ = o p (l), and sup^, ||a n (x) — a(x)|| = 
o p (l) implied by (5.1), in a routine fashion. □ 

A consequence of the above results is that whenever sup z | J a {x)\ ^ 0, the 
test that rejects Ho, whenever, with ipi := G U 9(H)/ H(2H — 1), 

(5.9) D n :=—^- . sup|V n (x)| > z a/2 , 

n u ip\ supg. \J n {x)\ * 

is of the asymptotic size a. Here z a is the 100(1 — a)% percentile of the 
N(Q, 1) d.f. In the simple linear regression model with nonzero intercept, 
that is, when r{x) = (l,x)', J a (x) = if and only if o~{x) is constant in x. 
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In the case of a polynomial regression through the origin, sup^ | J a {x)\ 7^ 0. 
In particular, the above test is applicable when fitting a heteroscedastic 
polynomial. 

Proving consistency of the proposed test against a fixed alternative is 
a delicate matter. However, suppose G U ,H and a are known such that 
sup x I J a {x)\ 7^0. Then the test that rejects Hq whenever sup x |V n (x)| > 
n H Zu^Tpi sup x I Jo- (a;) I will be consistent against the alternative fi(x) = (3'r{x) + 
£(x), for all x, where £ is such that E£ 2 (X) < 00 and 

sup \E[£(X) - E{r{X)£{X))' A- 1 r{X)]I{X < x)\ ^ 0. 

In the case these parameters are unknown, the above test (5.9) will be con- 
sistent against this alternative, provided estimators of these parameters con- 
tinue to be consistent under the given alternative. 

6. Numerical results. This section contains a simulation study and a 
real data application. 

6.1. A simulation study. In this simulation we take r(x) = (1, x)' , (3q = 0, 
fa = 2 and a 2 (x) = 1 + x 2 . The errors {it*} are taken to be FARIMA(0, H — 
1/2,0) with standardized Gaussian innovations and {X{\ is taken to be 
fractional Gaussian noise with the LM parameter h. The values of H, h 
range in the interval [0.6,0.95] with increments of 0.05. These processes 
were generated using the codes given in Chapter 12 of [4]. 

We first concentrate on the properties of Pi and H. Table 1 provides the 
root mean square errors (RMSE) of the LSE 0\ with sample size 500 and 
2000 replications. As can be seen from this table, when H + h increases, so 
does the RMSE of pi- Typically, when H + h< 3/2, the RMSE is small. 

Table 2 provides the RMSE's of the local Whittle estimator H of H based 
on i t = Y t — PiX t , 1 < t < 500, repeated 1000 times. From this table, we 



Table 1 

RMSE of the LSE /3i for sample size n = 500 



H\h 


0.60 


0.65 


0.70 


0.75 


0.80 


0.85 


0.90 


0.95 


0.60 


0.00873 


0.00864 


0.00881 


0.00980 


0.01046 


0.01151 


0.01352 


0.01922 


0.65 


0.00843 


0.00950 


0.01074 


0.01170 


0.01231 


0.01345 


0.01758 


0.02473 


0.70 


0.01039 


0.01011 


0.01141 


0.01354 


0.01463 


0.01760 


0.02150 


0.03410 


0.75 


0.01082 


0.01212 


0.01354 


0.01545 


0.01942 


0.02273 


0.03036 


0.04653 


0.80 


0.01229 


0.01395 


0.01765 


0.01924 


0.02438 


0.03328 


0.04788 


0.07352 


0.85 


0.01407 


0.01776 


0.02180 


0.02828 


0.03621 


0.04875 


0.07040 


0.12540 


0.90 


0.01859 


0.02369 


0.03099 


0.03984 


0.05404 


0.08341 


0.12010 


0.20873 


0.95 


0.02572 


0.03406 


0.05188 


0.06472 


0.11374 


0.17622 


0.27383 


0.49624 
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Table 2 

RMSE of H based on Yt — (3\Xt for sample size n — 500 



H\h 


0.60 


0.65 


0.70 


0.75 


0.80 


0.85 


0.90 


0.95 


0.60 


0.03964 


0.03867 


0.03931 


0.03870 


0.03774 


0.03940 


0.03985 


0.03938 


0.65 


0.04276 


0.04273 


0.04402 


0.04261 


0.04330 


0.04511 


0.04158 


0.04317 


0.70 


0.04808 


0.04750 


0.04770 


0.04819 


0.05041 


0.04856 


0.04746 


0.04690 


0.75 


0.05357 


0.05478 


0.05537 


0.05473 


0.05382 


0.05075 


0.05290 


0.04940 


0.80 


0.06228 


0.06303 


0.06231 


0.05918 


0.05822 


0.05973 


0.05813 


0.05509 


0.85 


0.07076 


0.07202 


0.07075 


0.06699 


0.06584 


0.06310 


0.06217 


0.05773 


0.90 


0.08334 


0.08323 


0.08078 


0.07803 


0.076877 


0.07237 


0.06785 


0.06508 


0.95 


0.11288 


0.11096 


0.10891 


0.10300 


0.09456 


0.08519 


0.07816 


0.06718 



Table 3 

Ranges for S of the bandwidths for estimation a 



H\h 


0.65 


0.75 


0.85 




0.95 


0.65 (a) 


(0.175, 0.3) 


(a) (0.125, 0.5) 


(a) (0.075, 0.3) 


(a) 


(0.025, 0.1) 


0.75 (a) 


(0.175, 0.3) 


(a) (0.125, 0.5) 


(a) (0.075, 0.3) 


(a) 


(0.025, 0.1) 


0.85 (b) (0.15, 0.3) 


(a) (0.125, 0.5) 


(a) (0.075, 0.3) 


(a) 


(0.025, 0.1) 


0.95 (b) (0.05, 0.3) 


(b) (0.05, 0.5) 


(b) (0.05, 0.3) 


(a) 


(0.025, 0.1) 






Table 4 










Summary of ASE(a 2 ) for 


H = 0.65 






h\ Summary 


Bandwidth 


Ql 


Median 


Mean 


Q3 


0.65 


3n- 02 


0.0261 


0.0369 


0.0424 


0.0512 


0.75 


3.5n-°' 2 


0.0256 


0.0383 


0.0432 


0.0557 


0.85 




0.0273 


0.0417 


0.0595 


0.0617 


0.95 


1.5n-°'°" 


0.0366 


0.0663 


0.1138 


0.1058 






Table 5 










Summary of ASE(a 2 ) for 


H = 0.75 






h\ Summary 


Bandwidth 


Ql 


Median 


Mean 


Q3 


0.65 


An- 02 


0.0442 


0.0711 


0.0887 


0.1127 


0.75 


An' - 2 


0.0465 


0.0652 


0.0888 


0.1076 


0.85 


An- - 2 


0.04667 


0.0774 


0.1043 


0.1252 


0.95 


2n-°-°" 


0.0627 


0.0995 


0.2190 


0.1902 



observe that, for H < 0.85, the overall RMSE is less than 0.072 and stable 
regardless of the values of h. 

Next, to assess the finite sample behavior of a 2 , we simulated the es- 
timator <t 2 (x) for the values of x in the grid x\ = —1.50, X2 = —1.49,..., 
x 30i — 1-50, and for 0.65 < H,h< 0.95. We used the built-in smoothing 
function of the R program with normal kernel and sample size 500 repeated 
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Table 6 

Summary of ASE(a 2 ) for H = 0.85 



h\ Summary 


Bandwidth 


Ql 


Median 


Mean 


Q3 


0.65 


4.5n -0 ' 2 


0.1562 


0.2724 


0.5402 


0.5584 


0.75 


6n-°' 2 


0.1594 


0.3113 


0.5449 


0.6330 


0.85 


5n" - 2 


0.1625 


0.3252 


0.5475 


0.6103 


0.95 


2.5n-°°" 


0.1704 


0.3155 


0.7092 


0.6235 



500 times. The ranges for 5 in the bandwidths b = Cn~ are given in Table 3 
according to the Remark 3.1. The symbols (a) and (b) indicate "Case a" and 
"Case b" of Theorem 3.1, respectively. Based on Table 3, for convenience, we 
used (5 = 0.2,6 = Cn~' 2 in our simulations for all cases of H and h considered 
except when h = 0.95. In this case, we used 5 = 0.099. The constant C is ad- 
justed for different values of H and h according to the average squared errors: 
ASE := Y?k=i{p 2 ( x k)/a 2 {xk) ~ 1) 2 /301. We record those C values which 
possibly make ASE the smallest. Some summary statistics of ASE are re- 
ported in Tables 4-7. It can be seen that the estimator a 2 {x) is relatively sta- 
ble for the values of H, h < 0.85. Similar results are observed when we replace 
the normal kernel by the kernel function K(x) = 0.5(1 + cos(x7r))/(|x| < 1) 
or the uniform kernel. 

6.2. Application to a foreign exchange data set. In this section we shall 
apply the above proposed lack-of-fit test to fit a simple linear regression 
model with heteroscedastic errors to some currency exchange rate data ob- 
tained from www.federalreserve.gov/releases/H10/hist/. The data are noon 
buying rates in New York for cable transfers payable in foreign currencies. 
We use the currency exchange rates of the United Kingdom Pounds (UK£) 
vs. US$ and the Switzerland Franc (SZF) vs. US$ from January 4, 1971 to 
December 2, 2005. We first delete missing values and obtain 437 monthly 
observations. The symbols X = dlUK and Y = dlSZ stand for differenced 
log exchange rate of UKJCvs. US$ and SZF vs. US$, respectively. We obtain 

mean(dlUK) =-0.0001775461, Stdev(dlUK) =0.001701488, 



Table 7 

Summary of ASE(a 2 ) for H = 0.95 



h\ Summary 


Bandwidth 


Ql 


Median 


Mean 


Q3 


0.65 


6n-°- 2 


1.153 


3.214 


16.24 


11.83 


0.75 


7n-°- 2 


1.137 


3.078 


14.75 


11.25 


0.85 


7.5n-°' 2 


1.018 


2.611 


12.77 


11.59 


0.95 


4.5n-°-°" 


1.136 


3.374 


12.57 


11.85 
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-2 2 

QO-plol ol dlUK and simulated FGN(H_X) 



Fig. 1. QQ-plot of dlUK. 

mean(dlSZ) =-0.00004525129, Stdev(dlSZ) =0.001246904. 

The local Whittle estimates of the LM parameters of dlUK and dlSZ 
processes, respectively, are 0.6610273 and 0.7147475. In computing these 
estimates we used m= [n/8] =54. 

Comparing the X-process with a simulated fractional Gaussian noise with 
h = 0.6610273 and n = 437, Figure 1 suggests that the marginal distribution 
of X is Gaussian. 

Next, we regress Y on X, using the normal density kernel regression func- 
tion estimator and a simple linear regression model. Both of these estimates 
are depicted in Figure 2. They display a negative association between X and 
Y. The estimated linear equation is Y = -0.000118775 - 0.4141107X, with 
a residual standard error of 0.00102992. 

Figure 3 provides the nonparametric kernel estimator of cr(x) when re- 
gressing Y on X with K(x) = 0.5(1 + cos(x7r))/(|x| < 1). 

The estimators of H based on e = Y — (3X and u = (Y — fiX) /&(X) are 
equal to 0.6046235 and 0.6246576, respectively. This again suggests the pres- 
ence of long memory in the error process. 

Finally, to check if the regression of Y on X is simple linear, we obtain 
D n = 0.4137897 with the asymptotic p- value 66%. As expected, this test fails 
to reject the null hypothesis that there exists a linear relationship between 
these two processes. 



HETEROSCEDASTIC REGRESSION AND LONG MEMORY 19 
APPENDIX 

This section contains some preliminaries and proofs. To begin with we give 
a reduction principle involving the kernel function K b . Let Q := {y : Ev 2 {X) < 
oo}, /x = EX, 7 2 = Vai(X) and Z=(X-fj) /j. Then Hermite expansion of a 
v € Q is equal to J2j>o( c j I 'j^-)Hj(Z), where now Cj = Evi^yZ + fx)Hj(Z). We 
also need the fact that for any auto-covariance function c(k) ~ Ck~ 2 ( l ~ 5 \ 
k^oo, 1/2 < 5 < 1, 

1\ /log_n 
n j \ n 



"- 2 EE c2 (l*- s D = °(r) +o(^)/(« = 3/4) 



t=l s^t 

(A.l) 

+ 0(n 4 ^ 4 )/(3/4<,5<l). 
We are now ready to state and prove the following: 

Lemma A.l. Let Xt,t€Z, be a stationary Gaussian process with [i = 
EX and j 2 = Var(X). Let £ be a real valued measurable function on M, K be 
a density kernel on M, and b = b n be a sequence of positive numbers, b^ 0. 
In addition, suppose x E R is such that 



(A. 2) sup / K 2 (v)i (x - bv)(p(x - bv) dv < oo. 

n>l J 

Let fi b (x) = EK b (x - X)£(X). Then, 

n- 1 £ (K bt (xnX t ) - » b (x)) - i(x) X -^J X -^-)n- 1 £ 

t=i 7 V 7 / i=1 \ 7 , 




Fig. 2. Kernel estimation ofr(x). 
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1 1 
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Standard deviati on f unci ion or SZ'UK 
Fig. 3. Kernel estimation of a(x) 



O r 



O r 



Or 



1 



nb 
1 



nb 
1 



+ O z 



nb 



1/2 



+ 



nb \/bn A - Ah J' 



l/2</i<3/4, 

h = 3/4, 
3/4</t<l. 



Proof. Without loss of generality, assume fj, = 0,7 = 1. Let x be as in 
(A.2). Let v n {X) := Vb[K b (x - X)£(X) - /z 6 (x)]. For each n > 1, Eu n (X) = 
0, and < bEK({x-X)i 2 {X) = f K 2 (v)£ 2 (x-bv)ip(x-bv) dv. Hence, 

under (A.2), sup{£^(X),n > 1} < oo, so that v n {X) £ G, Vn > 1. This in 
turn implies that sup n>1 J2j^=i c nj/j* < °°> where, Vj > 1, 

c nj -=^bj K b(x ~ y)^(y)H j (y)i P (y)dy = Vb{Hj{x)i{x)ip{x) + o(l)}. 



Hence, 



{-i " ~j 2 n n00 2 



<E 

i=2 



3 r i i 



+ ^EE^(i*-i 

t=l s^t 



This and (A.l) applied to c(k) =jx(k) complete the proof. □ 
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We also need to recall the following result from [11]. Suppose {u t } is as 
in (1.2) with Ee 4 < oo. Then, for all 1/2 < H < 1, 

(A.3) E(u 2 -l)(u 2 -l) = 2D 2 t 2 ^ 2H ^ + o(t 2 ( 2H ^), t^oo, 

(A. 4) Euou s (u 2 - 1) ~ 2j u (t)-f u (t - s), \t — s\— >oo, 

and for s,t,r such that \t — s\, \r — t\ and \s — r\ all tending to infinity, 

(A.5) Euou s utu r ~ 7 11 (s)7„(r - t) + -f u (t)^ u (r - s) + 7«(r)7 u (t - s). 

Use (A.3)-(A.4), and argue as in the proof of Lemma A.l with £ as in 
there, to obtain the following two facts under Assumptions 3 and 5, and 
(A.2): 

(A.6) E^n-^(K M (x)C(X t ) - n)u t \ =o(i)+ (-^), 

(A.7) E^j2{K u {x)i{X t )-v b ){u 2 -l)^ =o(l) + (— L^. 

Note that if K is supported on [—1, 1] and £ is continuous at x, then (A.2) 
holds, for in this case the l.h.s. of (A.2) is bounded above by Cmax| 2 | <fe ^ 2 (x — 
z) — ► C£ 2 (:c) < oo. It also holds if £ is bounded and if is square integrable 
on R. In particular, under Assumptions 3 and 4 and continuity of r at x, it 
holds for £(y) = cr 2 (?/) and £(y) = a(y)r(y), y G R. 

Next, we give some inequalities that are useful in approximating an av- 
erage of certain covariances of a square integrable function of a Gaussian 
vector by the corresponding average where the components of the Gaussian 
vector are i.i.d. Accordingly, let E2 denote the expectation of a standard 
/c-dimensional normal random vector. Let ^4o,s,t be the covariance matrix of 
X ,X s ,X t , 0<s<t, and B , s ,t = M,s,t —h = {{hj{s,t))), where h is the 
3x3 identity matrix. Let g s j denote the largest eigen value of Bo ;S ,t- From 
[23], Chapter 6.2, page 194, we obtain that g St t < maxj^ =1 \ bij(s,t)\. This 
in turn implies that 

(A.8) S) t<3|7x(i-s)|V|7x(s)|V| 7 x(i)| Vs<t,s,teZ. 

For a square integrable function g of k r.v.'s, let \\g\\1 '■= (E^g 2 ) 1 / 2 and 
73(72) denote the Hermite rank of g(Xo, X\, X2) — Eg(Xo, X±, X2) (g(Xo,Xi) — 
Eg(X ,Xi)). Since both r 3 Ar 2 > 1, Theorem 2.1 of [27] yields that, for suf- 
ficiently large \s — i\, \t — i\ and \t — s|, 3 a C < 00 free of i,s,t, such that 

I Eg(Xi ,X„X t )- E°g(Xi ,X„X t )\ 

(A.9) 

< C|| 5 ||°^ MM < Chx(t - s)\ V \ lx (s -i)\ V | 7 x(t -*)|. 



(d\x) - a\x)) + ^L[d'Z n d-2d'S n ] + 
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\Eg(X s ,X t )-E°g(X s ,X t )\ 

(A.10) 

<C\\g\\hx /2 (t-s)- 
In turn, (A. 9) implies that, uniformly in i = 1,.. . , n, 

(A.ll) (n- l)- 2 J2T,( E 9(X t ,X s ,X t ) - E^iX^Xs^t)) ^0. 

Proof of Theorem 3.1. Let x be a point at which r is continuous 
and Assumption 4 holds. Let d := ((3-/3), a 2 (x) := J2t=i Kbt(x)a 2 u t/ ni f( x )i 

~y n 1 n 

S n ■= r^^2 K bt( x ) a trtu t , S n := — y^K bt {x)r t r' t . 

n<p{x) nip{x) ^ 

Rewrite <r 2 (x) — a 2 (x) as the sum 

"( x ) J -2/ \ 
-r-T - 1 <T (X) 

=: I* + II* + III* . 
Now let ffe := EK h (x — X)a 2 (X) and rewrite I* = I + /J, where 

1 ™ 

n</?(x) ^ 

1 n 

First consider the term //. Use Assumption 4 to verify that 
(A.12) II 2 < Cb 2 . 

Assumption 5 implies that n 2 ~ 2h = o{nb). By Lemma A.l applied to £(y) = 
cr 2 (y), we obtain that, under Assumptions 3-5, 

(A.13) n 1 - h IIi=j- 1 (x-n)a 2 (x)Z nl + Op(l) Vl/2<h<l. 

Now consider the term /. Because EI = 0, (A.6)-(A.7) applied to £(y) = 
a 2 {y) yield that EI 2 = O^nb)" 1 ), for 1/2 < H < 3/4, EI 2 = 0((n6)^ 1 + 
n 4H - 4 ), for 3/4 < H < 1, and I = O p ({nb)~ l l 2 ln 1 / 2 ^)), for # = 3/4. We 
summarize these results here: Under Assumptions 2-5, and Ee A < oo, 

I = 0p (-L + n 2H ~ 2 I(H > 3/4) + = 3/4)) , 

\ynb ynb J 

(A.14) 

// = O p (n /l - 1 + 6 2 ). 
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Next, consider S n . By (A. 3) with £(y) = a(y)rj(y), j = 1, . . . ,q, we obtain 

n 

(A.15) n l ~ H S n = r(x)a(x)n~ H J2 u t + 

t=i 

Note also that E(E n ) — > r(x)r'(x). 

Lemma A.l applied to £(y) = (r(y)r'(y)) i:j , i,j = l,...,q, yields 

(A.16) S n - J B(S n ) = O p (n^ 1 ). 

To deal with the III* term, by (A.l) applied with c{k) = 7x (fc), we have 

s 2 ~ 7 2 = 1 - - 7 2 ] -(X- = o p (n h ^). 

This and the identity s — 7 = (s 2 — 7 2 ) /(s + 7) yield that s — 7 = o p (n /l_1 ). 
In turn, this fact, the continuity of (p and the Taylor expansion yield that 

(A.17) (rfEL -l)= X -ZJL n ^ h {X - M )/ 7 + o p (l). 

PROOF of (a). Here H < (l + h)/2. From (A. 12) and (A. 14), one sees that 
in this case n l ~ h I = o p (l) = n l ~ h Il2- Hence, by (A. 13) and Assumption 5, 

n l - h I* = ^^a\x)Z nl + o p {\). 

7 

By (A.15) and (A.16), II* is negligible, because 

II* = O p (n 2 ( H -V + n VH-2)+(h-i) + n 2(ff-i)) = ^(nfc-i). 

These results, (A.17) and the fact that <7 2 (x) — > p <t(x) yield 

(A.18) n 1 " V(x) - a 2 (x)) = ^~V(x)(Z nl + Z n2 ) + 0p (l), 

7 

where now Z n2 = n~ J2t=l(-Xt — m)/t- By the independence of X 4 's and 
Ut's, Z n i and Z n2 are independent. Clearly, under the assumed conditions, 
(A.18) holds for each xi,...,Xf. given in part (a). Hence, part (a) follows 
from (2.2) and the Cramer-Wold device. 

Proof of (b). In this case, 2H - 2 > h - 1. Let a = 2- 2H . Then, by 
(A.17), n a III* =Op(l). By (2.1) applied with z/(x) = r(x)a(x), 

n 

(A.19) n l ~ H d = n~ H A~ l ^ r t a t n t = 4u m Z n i + o p {l). 

t=i 

By (A.16), because 2H - 2 > h- 1, 

n 2 - 2H d'^ n d = n 2 - 2H d'[r(x)r(x)']d + o p (l) 

= f/ ra A~ 1 [r{x)r(x)']A~ 1 ii ra Z^ 1 + o p (l). 
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By (A.15) and (A.19), n 2 ~ 2H d'S n = fi' ra A^ 1 r(x)a(x)Z 2 1 + o p (l). Since <p n (x) 
— >p tp{x) > 0, we thus obtain 



11 



2-2H 



II* = f i' ra A- 1 r(x)[r(xyA- 1 fi ra + a(x)}Zl 1 +o p (l). 



n 



v \ 

Next, consider I* = I + II . Because h>l/2, H > (1 + h)/2 implies that 
H > 3/4. Hence, here the term / is the dominating term. Recall Z* 2 = 
n \~2H 'Y^ =1 {u 2 - 1). By (A. 7) applied with £(y)=a 2 (y), we obtain 

2-2H T * 2-2H T , /i\ I'd 7 * 

n I =n I + o p (l) = ^-jZ n2 + o p (l). 

Upon combining these results, we obtain (3.3) in the case (b). The claim 
(Z* 2 , Z n i) ^ d (Z^tpiZ) is proved as in [28]. 

To prove the claim about Correl(Z* 2 , Z 2 -^), note that E^n 1- ^ u) 2 — > ip 2 , 
and by (A. 5), 

j n—l n—t n—t n—t 

E (u) 4 = ^4 J2 zZ Eu u Sl u S2 u S3 

t=l Si=lS2 = lS3=l 
^ n—l n—t n—t n—t 

7aZ~2zZzZ z2^(sih u (s3-s 2 ) 

t=l Si = l S2=l S3 = l 

+ 7u(s 2 hu(si ~ S 3 ) + lu(s 3 )j u (s 2 ~ Si)} 

t=l Sl = l v ' 

Hence, Var(u 2 ) ~ 2ip 2 n~ A+iH . By (A. 3) and (A. 4), similar calculations yield 

^"^'-") >^«-.r ' 

This completes the proof of Theorem 3.1. □ 

Next, to prove Theorem 4.1, we need the following preliminaries. Let £ 
be an arbitrary function such that E£(X) = 0, E£ 2 (X) = 1. Let & := f 
and Cj := E^(jZ + fj,)Hj(Z), j > 1. Let := 7x(^)/7 2 and r > 1 be the 
Hermite rank of £(X). Then, the auto-covariance function of the process 



2 

£ is 7£ (A;) = , yx(k) T (^f + X^L r (j+i)! (ky +1 ~ T ) , where the second term is 
bounded above by 2~Zj> i c j/j' = 1- Therefore, there exists a constant C = 
C(t,Gx) free of fc, such that 

(A.20) ^{k)~Cp T k , k^oo. 



„2 
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Lemma A. 2. With £t as above and ut as in (1.2), let denote the 
periodogram of£tUt- Then, as A — > 0, 

FT (\\ = f°( XT{2 ~ 2h)+l ~ 2H )i 0<T(2-2h) + (2-2H)<l; 
lO(logn), t(2-2/i) + (2-2H)>1. 

Proof. From (v. 2.1), page 186 of [35] we obtain that, for any < a < 1, 
n / \ 

(A.21) ^t- a e iiA ^A a ~ 1 r(l-a)(sin|a + icos|a), A -► 0. 
t=i V 2 2 / 

Since 7li (jfe) ~ Ck 2H ~ 2 , by (1.4), (A.20), (A.21) with < a = r(2 - 2h) + 
(2 — 2i/) < 1 and the independence of & and 14, imply that, as A — ► 0, 

n n 

EhuW = ^ E E T?(i " fcK(3 " fcje 1 ^ 
j=ifc=i 



n fe— 1 

Cn -1 V ^ r [r(2-2/ l )+(2-2if)] e iU< Q^r{2-2h)+l- 
k=l t=k—n 



2H 



In the case r(2 - 2/i) + (2 - 2if) > 1, EI^ U (X) < Clogn, for all A G [-7r,vr]. 
□ 

The following fact proved in [7] is also needed for the proof of Theorem 4.1. 
Under (1.2) and when m = o(n), 



(A. 22) sup [vm] 1 y~] I u (\j)/ fj — > 1 a.s. m — ► oo. 

o<d<i j=1 

Proof of Theorem 4.1. The basic proof is the same as in [25], with 
some difference in technical details. So we shall be brief, indicating only the 
main differences. With o"o = Ea(X),ro := Er(X), let n t := et — croUt, £ := 
(J3 - P)'ro and ( t := (/3 - /3)V(X i ) . Then e t = ^Ct + »7t + ^o«t- Let = \)~ 2H 
and =: [ig(Aj) — <7Q/ u (Aj )]//,•. According to the proof of Theorem 3 in 
[25], to prove Theorem 4.1 for 1/2 < H < 1, it suffices to verify the following 
three claims: 



m— 1 



(A.23) J2 



8=1 



7 

m 

m— 1 



2(oi-H)+l 1 



(A.24) (logn) 2 £ - 



i=i 



1-2(5 



E^ 

i=i 



0. 



for some small 5 > 0, 
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(A . 25) flS=£5>ia 

j=i 

We use the following elementary inequalities in verifying these conditions: 
|/g(A) - a 2 I u (X)\ < 2a \I u (\)I v (X)\ 1/2 + IvW, 

(A.26) 

I v (X)<3(I i (X) + I ( (X) + I v (X)) VAG [-vr,vr], 

where Vt := £ + (t + Vt- Let Iy,j '■= Iy(Xj) for any process Yt. 

Recall that for the Dirichlet kernel D k {\) := £t=i e itx , \D k (X)\ < C/X, for 
all AG [-ir,n],k> 1. Also by (2.2) and (A. 19), 

(A.27) n 1 - H \\p-P\\=O p {l). 

These bounds imply that -j 2 - = O p (n 2H ~ 3 X 2H ~ 3 ), uniformly in 1 < j < m. 

Now, consider the terms Q- Assumption 2 implies that the Hermite ranks 
of rj(X) — roj,j = 1,2, . . . ,q, are at least one. Hence, by (A.26), (A.27) and 
(1.4), we obtain that, uniformly for 1 < j < m, 

(A.28) kA = 0p {X 2 l' H - h) n H - 1 ). 

Jj 

Similarly, using Lemma A. 2 and the fact that the Hermite rank r of 
(j(X) — <jq is at least 1, we obtain, uniformly for 1 < j < m, 



(A.29) 



hhi = { P {X] (2 ' 2h) ), < r(2 - 2h) + (2 - 2H) < 1; 

fj [OpiX^-Hogn), r(2 - 2/i) + (2 - 2tf) > 1. 



Now we are ready to verify (A.23)-(A.25). Let an '■= 2(a\ — H). By 
changing the order of summation, the l.h.s. of (A. 23) is bounded above 
by Cm-^^YIjLij^Pjl for H > oi, and by Cm" 1 logm^i Pjl for 
H = a\. But, by (A.26) and the C-S inequality, 



m m I T T 1 1/2 i T 

E r H m<cj2j aH ol u ' j V ' jl + v > 



>3 

' ~ T 

3=1 3=1 h 



<c 



/ m it I m I j I \ 1/2 m \ T \ 

E f H ]J ¥- E-T* ^ + E ^ 2QH ^ 
^■=i j=i /j / j=\ jj 



-a H -l- 



We also have the following facts where c m := m 

m j m j m j 

E f H ¥ = E f m ¥ = E ¥ = °^ c ™)- 

j=l Jj j=l Jj j=l Jj 

These bounds together with (A.22), (A.28) and (A.29) imply (A.23) for 
H > a\. The proof of (A.23) for H = a\ is similar. The conditions (A. 24) 
and (A. 25) are verified similarly. □ 
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Proof of Lemma 5.1. To prove (5.3), it suffices to show 

(A.30) ^j(-±_J2K b3 (X t )a](u]-l)\ - 0, 



\ 2 

-►0. 



/ 1 

(A.31) max E — j ]T K bj (X t )a 2 - a 2 (p{X t 

To prove (A.30), the expectation in the l.h.s. of (A.30) equals A n t + B n t, 
where A n , t := (n - l)" 2 E{K^(X t )a^}E{^ - l) 2 , and 



n n 

B n,t=--J—^Y. E E{K bj (X t )K bk (X t )a]4} 

'i 

xE(u 2 -l)(u 2 k -l). 



(n - l) 2 



Apply (A.9) and (A.ll) to g(X t , Xj, X k ) = K bj (X t )K bk (X t )a 2 a 2 k . Note that 
for this g, \\g\\s < C6 _1 . Hence, uniformly in t, 
C 



B n ,t\ < 



(n-1) 5 



n n 

><E E {^6iTO^6fcTOof^ 



(A.32) 

+ C6- 1 ^. |f|t _ fc| }|S(uJ-l)(u2-l)| 

by (A. 3). Similarly, by (A. 10), we obtain that, uniformly in t, A n ^ < C(n6) _1 . 
Hence, (A.30) holds. To prove (A.31), rewrite the l.h.s. of (A.31) as 



£(n-l)- 2 K bj {X t )K bk (X t )a]a 



- 2E{n-l)' l a 2 {X t ) ( t ) {X t )YK b] (X t )a 2 J +Ea\X)^(X) 

i¥* 

=: C n>t - 2D n>t + Ea^X^iX), say. 

Similar to the argument in (A.32), by (A.9) and (A. 10), the terms C n j and 
D n j tend to Ea 4 (X)c/) 2 (X) uniformly in t, thereby proving (A.31). This 
completes the proof of (5.3). 

Next consider (5.4). Arguing as for (5.5), it suffices to show that 



n 



(A.33) n~ 1 Y.\ k ^ X t)-^-t(Xt)\r ll \X t )^. 



t=i 
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But \k 2 _ t (X t ) -A 2 _ t (X t )\ is equal to 
1 



n 



- K bi (X t )[((3 - P)'r(Xi)] 2 + 2(13 - /3)'r(*i)<W 



Moreover, (A. 10) implies that, for k = 0, 1, 2, 1 < j < q, 



E 



1 



n(n — 1) 



Y J Y, K bi( x t)^ k J (x t )^ l / 2 {x t 



< 



c 



t=l i^t 

n n 



< 



n(n- 1) fr[ 
C 



lb lb n n 

I)EE J J K b (x-y)a(y)\r j (y)\ k ( f>- 1/2 (x)^ t (x,y)dxdy 



n n 



n(n — 1) 



lb IV n n 

I)EE j j K b {x-y)a{y)\r j {y)\ k ^ 2 {x)^y)dxdy 



t=i i^t 



+ 



C 



n l-h h l/2 
n n 



o(i), 



t=l i^t 



< 



c 



n n „ „ pi 

I)EE J J K b(x~ y)\rj(y)\ k 4> 1/2 (x)(f>(y) dxdy + ^h^j2 



n(n-l)^^ 
= 0(1). 

In the above we used the assumptions Erj(X) < oo, j = 1, . . . , q. Therefore, 
(A.33) holds because \\/3 - f3\\ 0. □ 

Acknowledgments. The authors are grateful to the two referees for their 
constructive comments and to the Co-Editor Morris Eaton for his patience. 
They are especially grateful to the Associate Editor of this paper whose 
diligent and concerted effort helped to improve the presentation of the paper. 

REFERENCES 

[1] An, H. Z. and Cheng, B. (1991). A Kolmogorov-Smirnov type statistic with appli- 
cation to test for nonlinearity in time series. Internat. Statist. Rev. 59 287-307. 

[2] Andrews, D. W. K. (1995). Nonparametric kernel estimation for semiparametric 
models. Econometric Theory 11 560-596. MR1349935 

[3] Arcones, M. (1994). Limit theorems for nonlinear functionals of a stationary Gaus- 
sian sequence of vectors. Ann. Probab. 22 2242-2274. MR1331224 

[4] Beran, J. (1994). Statistics for Long-Memory Processes. Chapman and Hall, New 
York. MR1304490 

[5] Berman, S. (1992). Sojourns and Extremes of Stochastic Processes. Wadsworth and 
Brooks/Cole, Pacific Grove, CA. MR1126464 



HETEROSCEDASTIC REGRESSION AND LONG MEMORY 



29 



[6] Dahlhaus, R. (1995). Efficient location and regression estimation for long range 
dependent regression models. Ann. Statist. 23 1029-1047. MR1345213 

[7] Dalla, V., Giraitis, L. and Hidalgo, J. (2006). Consistent estimation of the 
memory parameter for nonlinear time series. J. Time Ser. Anal. 27 211-251. 
MR2235845 

[8] Davydov, J. A. (1970). The invariance principle for stationary processes. Theory 

Probab. Appl. 15 487-498. MR0283872 
[9] Dehling, H. and Taqqu, M. S. (1989). The empirical process of some long-range 

dependent sequences with an application to [/-statistics. Ann. Statist. 17 1767- 

1783. MR1026312 

[10] Fox, R. and Taqqu, M. (1987). Multiple stochastic integrals with dependent inte- 
grators. J. Multivariate Anal. 21 105-127. MR0877845 

[11] Guo, H. and Koul, H. L. (2007). Nonparametric regression with heteroscedastic 
long memory errors. J. Statist. Plann. Inference 137 379-404. MR2298945 

[12] Giraitis, L., Koul, H. L. and Surgailis, D. (1996). Asymptotic normality of 
regression estimators with long memory errors. Statist. Probab. Lett. 29 317- 
335. MR1409327 

[13] Hart, J. D. (1997). Nonparametric Smoothing and Lack-of-Fit Tests. Springer, New 
York. MR1461272 

[14] Henry, M. and Robinson, P. (1995). Bandwidth choice in Gaussian semiparametric 
estimation of long range dependence. Athens Conference on Applied Probability 
and Time Series Analysis II 220-232. Lecture Notes in Statist. 115. Springer, 
New York. MR1466748 

[15] Hidalgo, J. and Robinson, P. (2002). Adapting to unknown disturbance autocorre- 
lation in regression with long memory. Econometrica 70 1545-1581. MR1929978 

[16] Ho, H. and Hsu, N. (2005). Polynomial trend regression with long memory errors. 
J. Time Ser. Anal. 26 323-354. MR2163286 

[17] Koul, H. L. (1992). M-estimators in linear models with long range dependent errors. 
Statist. Probab. Lett. 14 153-164. MR1173413 

[18] Koul, H. L. and Mukherjee, K. (1993). Asymptotics of R-, MD- and LAD- 
estimators in linear regression models with long range dependent errors. Probab. 
Theory Related Fields 95 535-553. MR1217450 

[19] Koul, H. L. (1996). Asymptotics of M-estimators in non-linear regression with long- 
range dependent errors. Athens Conference on Applied Probability and Time 
Series Analysis II. Lecture Notes in Statist. 115 272-290. Springer, New York. 
MR1466752 

[20] Koul, H. L., Baillie, R. and Surgailis, D. (2004). Regression model fitting with 
a long memory covariate process. Econometric Theory 20 485-512. MR2061725 

[21] Koul, H. L. and Stute, W. (1999). Nonparametric model checks for time series. 
Ann. Statist. 27 204-237. MR1701108 

[22] Lahiri, S.N. (2003). Reampling Methods for Dependent Data. Springer, New York. 
MR2001447 

[23] Luenberger, D. G. (1979). Introduction to Dynamical Systems: Theory and Appli- 
cations. Wiley, New York. 

[24] Robinson, P. M. (1995). Gaussian semiparametric estimation of long range depen- 
dence. Ann. Statist. 23 1630-1661. MR1370301 

[25] Robinson, P. M. (1997). Large-sample inference for nonparametric regression with 
dependent errors. Ann. Statist. 25 2054-2083. MR1474083 

[26] Robinson, P. and Hidalgo, J. (1997). Time series regression with long-range de- 
pendence. Ann. Statist. 25 77-104. MR1429918 



30 



H. GUO AND H. L. KOUL 



[27] Soulier, P. (2001). Moment bounds and central limit theorem for functions of Gaus- 
sian vectors. Statist. Probab. Lett. 54 193-203. MR1858634 

[28] Surgailis, D. (1982). Zones of attraction of self-similar multiple integrals. Lithuanian 
Math. J. 22 327-340. MR0684472 

[29] Stute, W. (1997). Nonparametric model checks for regression. Ann. Statist. 25 613- 
641. MR1439316 

[30] Stute, W., Thies, S. and Zhu, L. X. (1998). Model checks for regression: An 
innovation process approach. Ann. Statist. 26 1916-1934. MR1673284 

[31] Stute, W. and Zhu, L. X. (2002). Model checks for generalized linear models. Scand. 
J. Statist. 29 333-354. MR1925573 

[32] Taqqu, M. (1975). Weak convergence to fractional Brownian motion and to Rosen- 
blatt process. Z. Wahrsch. Verw. Gebiete 31 287-302. MR0400329 

[33] Yajima, Y. (1988). On estimation of a regression model with long-memory stationary 
errors. Ann. Statist. 16 791-807. MR0947579 

[34] Yajima, Y. (1991). Asymptotic properties of the LSE in a regression model with 
long-memory stationary errors. Ann. Statist. 19 158-177. MR1091844 

[35] Zygmund, A. (2002). Trigonometric Series, 3rd ed. Cambridge Univ. Press. 
MR1963498 

Department of Statistics and Probability 

Michigan State University 

East Lansing, Michigan 48824-1027 

USA 

E-MAIL: guohongwOstt.msu.cdu 
koulOstt . msu. cdu 



